10/590524 

IAP12Rec , dPCT/PT024 AUG 2006 



DESCRIPTION 

MOTION COMPENSATION METHOD 

Technical Field 

5 The present invention relates to a motion compensation 

method for interpolating sub-pixels into a reference picture and for 
performing motion compensation based on the interpolated 
reference picture. 

10 Background Art 

Moving pictures are being adopted in an increasingly number 
of applications ranging form video telephony and video conferencing 
to DVD and digital television. When moving pictures are 
transmitted, a substantial amount of data has to be sent through 

15 conventional transmission channels of a limited available frequency 
bandwidth. In order to transmit the digital data through the limited 
channel bandwidth, it is inevitable to compress or reduce the volume 
of the transmission data. 

In order to enable inter-operability between systems 

20 designed by different manufactures of any given application, 
video-coding standards have been developed for compressing the 
amount of video data. The coding approach underlying most of 
these standards consist of the following main steps: 

(1) Dividing each video frame into blocks of pixels so that 
25 processing of the video frame can be conducted at a block level; 

(2) Reducing spatial redundancies within a video frame by 
subjecting video data of each block to transform, quantization and 
entropy coding; 

(3) Exploiting temporal dependencies between blocks of 
30 subsequent frames in order to only transmit differentials between 

subsequent frames. 

Temporal dependencies between blocks of subsequent frames 
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ATTACHMENT "A" 



are determined by employing a motion estimation and compensation 
technique. For any given block, a search is performed in previously 
coded and transmitted frames to determine a motion vector which 
will be used by the coding apparatus and the decoding apparatus to 
5 predict the image data of a block. 

An example configuration of a video coding apparatus is 
illustrated in FIG. 1. The shown video coding apparatus generally 
denoted with reference numeral 900 includes: a 
transform/quantization unit 920 to output quantized transform 

10 coefficients QC by transforming spatial image data to the frequency 
domain and quantizing the transformed image data; an entropy 
coding unit 990 for performing entropy coding (variable length 
coding) of the quantized transform coefficients QC and outputting 
the bit stream BS; and a video buffer (not shown) for adopting the 

15 compressed video data having a variable bit rate to a transmission 
channel which may have a fixed bit rate. 

The coding apparatus shown in FIG. 1 employs a DPCM 
(Differential Pulse Code Modulation) by only transmitting 
differentials between subsequent fields or frames. A subtractor 

20 910 obtains these differentials by receiving the video data to be 
coded as an input signal IS and subtracting the previous image 
indicated by a prediction signal PS therefrom. The previous image 
is obtained by decoding the previously coded image. This is 
accomplished by a decoding apparatus which is incorporated into 

25 video coding apparatus 900. The decoding apparatus performs the 
coding steps in a reverse manner. More specifically, the decoding 
apparatus includes: an inverse quantization/transform unit 930, and 
an adder 935 for adding the decoded differential (differential 
decoding signal DDS) to the previously decoded picture (prediction 

30 signal PS) in order to produce the image as will be obtained on the 
decoding side. 

In motion compensated DPCM, a current frame or field is 
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predicted from image data of a previous frame or field based on an 
estimation of the motion between the current and the previous 
images. Such estimated motion may be described in terms of 
2-dimensional motion vectors representing the displacement of 
5 pixels between the previous and the current images. Usually, 
motion estimation is performed on a block-by-block basis. An 
example of the division of the current image into plurality of blocks 
is illustrated in FIG. 2. 

During motion estimation, a block of a current frame is 

10 compared with blocks in previous frames until a best match is 
determined. Based on the comparison results, an inter-frame 
displacement vector for the whole block can be estimated for the 
current frame. For this purpose, a motion estimation unit 970 is 
incorporated into the coding apparatus together with the 

15 corresponding motion compensation unit 960 included into the 
decoding path. 

The video coding apparatus 900 of FIG. 1 performs operations 
as follows. A given video image indicated by an input signal IS is 
divided into a number of small blocks, usually denoted as "macro 
20 blocks". For example, video image shown in FIG. 2 is divided into a 
plurality of macro blocks, each of which usually having a size of 16 
x 16 pixels. 

When coding the video data of an image by only reducing 
spatial redundancies within the image, the resulting frame is 

25 referred to as an I-picture. I-pictures are typically coded by 
directly applying the transform to the macro blocks of a frame. 
I-pictures are large in size as no temporal information is exploited to 
reduce the amount of data. 

In order to take advantage of temporal redundancies that 

30 exist between successive images, a prediction coding between 
subsequent fields or frames is performed based on motion 
estimation and compensation. When a selected reference frame in 
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motion estimation is a previously coded frame, the frame to be 
coded is referred to as a P-picture. In case both, a previously 
coded frame and a future frame are chosen as reference frames, the 
frame to be coded is referred to as B-picture. 
5 Although the motion compensation has been described to be 

based on a 16 x 16 macro block, motion estimation and 
compensation can be performed using a number of different block 
sizes. Individual motion vectors may be determined for blocks 
having 4x4, 4x8, 8x4, 8x8, 8x1 6, 16x8, or 16x16 pixels. The 

io provision of small motion compensation blocks improves the ability 
to handle fine motion details. 

Based on the results of the motion estimation operation, the 
motion compensation operation provides a prediction based on the 
determined motion vector. The information contained in a 

15 prediction error block resulting from the predicted block is then 
transformed into transform coefficients in transform/quantization 
unit 920. Generally, a 2-dimensional DCT (Discrete Cosine 
Transform) is employed. The resulting transform coefficients are 
quantized and finally entropy coded (VLC) in entropy coding unit 

20 990. 

A decoding apparatus receives the transmitted bit stream BS 
of compressed video data and reproduces a sequence of coded video 
images based on the received data. The configuration of the 
decoding apparatus corresponds to that of the decoding apparatus 

25 included in the coding apparatus shown in FIG. 1. A detailed 
description of the configuration of the decoding apparatus is 
therefore omitted. 

In order to improve the accuracy of motion compensation, a 
sub-pixel accuracy of reference frames is widely used. For example, 

30 1/2 sub-pixel accuracy motion compensation is used in the MPEG-2 
format. 

In order to further increase the motion vector accuracy and 
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coding efficiency, a 1/3 and a 1/6 sub-pixel vector accuracies have 
been proposed in Patent Literature EP 1 073 276. 

The motion vector accuracy and coding efficiency can further 
be increased by applying interpolation filters in motion estimation 
5 and compensation yielding 1/8 sub-pixel displacements. However, 
such a sub-pixel resolution requires high computation complexity, in 
particular, calculation registers having a length of up to 25 bits. 

Such a complex implementation may be based on a 2-step 
approach. In the first step a 1/4 sub-pixel image employing an 
10 8-tap filter is calculated. In second step a 1/8 sub-pixel is obtained 
based on the 1/4 sub-pixel image by employing a bilinear filtering. 

The filtering operation for generating the image with the 1/4 
sub-pixel accuracy includes the steps of horizontal and subsequent 
vertical filtering. The horizontal interpolation may be performed 
15 based on the following Equations (1) to (3): 



\ =-3-A 4 +12- B A -37C 4 +229 -At + 71£ 4 -21F 4 + 6G 4 -1-// 4 ...(1) 
h 2 = -3A 4 +l2B 4 -39 C 4 +158- £> 4 + 158£ 4 -39-F 4 +12 G 4 -3 // 4 ...(2) 
/*3 =-1 A 4 +6 5 4 -21C 4 + 71£> 4 + 229 £ 4 -37-F 4 +12 G 4 -3-// 4 ...(3) 



In the above equation, hi to h 3 denote the 1/4 sub-pixel 
20 values and A x -H x represent the original full-pel pixel values, namely, 
the pixels from the original image. 

Coefficients applied to the above A x — H x are set in a way that 
the signal processing is performed preventing the occurrence of 
imaging by upsampling, in other words, unnecessary high frequency 
25 components generated through interpolation are eliminated. 

The horizontal filtering is illustrated in FIG. 3. Eight-tap 
filtering is performed based on the pixel values of the original pixels 
210 and the pixel values of the three intermediate pixels 220 are 
calculated in order to obtain a 1/4 sub-pixel accuracy in the 
30 horizontal direction. 
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After the horizontal filtering has been completed, the 
resulting image data having a full-pel pixel accuracy in the vertical 
direction and a 1/4 sub-pixel accuracy in the horizontal direction are 
subjected to vertical filtering. For this purpose, the following 
5 Equations (4) to (6) having coefficients which correspond to those of 
the above described horizontal filter are employed. 

v, =-3 D, + 12-D 2 -37-D 3 +229 A, +71-Z) 5 -21 D 6 + 6Z> 7 -1-D 8 ...(4) 
v 2 = -3D, +12- D 2 -39 D 3 + 158D 4 + 158 -Z> 5 ~39D 6 + 12£> 7 -3-D 8 ...(5) 
v 3 =-1-/), + 6£> 2 -21D 3 + 71 A, + 229 D 5 -37Z) 6 +12 • D 7 - 3 • Z) 8 ...(6) 

10 In the above equations, vi to v 3 refer to the calculated vertical 

1/4 sub-pixel values and D x , D 2 , D 3/ D 4 , D 5 , D 6 , D 7 and D 8 represent 
the full-pel resolution pixels , namely, the pixel values of the original 
pixels 210. 

Like in the case described above, coefficients applied to D x are 
15 set in a way that the signal processing is performed preventing the 
occurrence of imaging by upsampling, in other words, unnecessary 
high frequency components generated through interpolation are 
eliminated. 

The resulting pixel values have a length of up to 25 bits. In 
20 order to obtain image data in each of the pixel values fall into a 
predefined range of allowable pixel values, the calculation results 
are downshifted and rounded as illustrated. An example case of 
pixel value Vi is shown by the following Equation (7): 



25 



256 2 



v,+^- »16...(7) 



Here, Vi represents the pixel value resulting from the 
horizontal and vertical filtering, while Vi' represents the downshifted 
pixel value. The downshifted pixel values are further clipped to the 
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range of 0 to 255. 

The vertical filtering is illustrated in FIG. 4. The pixel values 
of the pixels 230 obtained during vertical filtering complete the 
sub-pixel array illustrated by way of filtering example between 
5 original pixels D 4 , D 5/ E 4 and E 5 . 

After having the 1/4 sub-pixel image completed, a 1/8 
sub-pixel frame is calculated by applying a bilinear filtering to the 
1/4 sub-pixel resolution. In this manner, intermediate pixels are 
generated between each of the 1/4 resolution pixels. 

10 A bilinear filtering is applied in two steps and is illustrated by 

way of examples in FIG. 5 and FIG. 6. Starting from the 1/4 
sub-pixel resolution, FIG. 5 illustrates the application of a horizontal 
and vertical filtering. For this purpose, a mean value is calculated 
from the respective neighbouring pixel values in order to obtain an 

15 intermediate pixel value of a 1/8 sub-pixel resolution. When 
employing a binary representation for this processing, the following 
Equation (8) can be applied. Note that in Equation (8) 

represents 1-bit downshifting. 

20 A = (B + C + l)»l..XS) 

The remaining 1/8 sub-pixel values to be interpolated are 
calculated by diagonal filtering as illustrated in FIG. 6. It is a 
particular advantage of this approach that, in the bilinear filtering, 

25 the number of sub-pixel values stemming from multiple filtering is 
minimized as much as possible. For this purpose, it is preferable 
that only those pixel values, of the interpolated pixels, that are 
directly derived from original pixel values 210 are taken into account. 
In other words, those derived pixel values are the pixel values of the 

30 interpolated pixels located between the original pixels. 

All intermediate pixel values can be calculated therefrom, in 
other words, from the pixel values of the original pixels 210 and the 
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intermediate pixel values derived from the original pixel values, 
when additionally taking center pixel 240 of the sub-pixel array into 
account. The calculation operation for the additional 1/8 sub-pixel 
values is based on two of the 1/4 sub-pixel resolution values, 
5 respectively. The individual pixel values taken into account for the 
calculation of an intermediate pixel value are illustrated in FIG. 6 by 
respective arrows. Each of the arrows shows two pixel values of 
pixels based on which each intermediate pixel value of the two is 
calculated. Depending on the distance of the pixels to be taken into 
10 account for interpolation, the following Equations (9) and (10) are 
employed: 

D = (£ + F + l)»l...(9) 
G = (3// + / + 2)»l...(10) 

15 In the above equations, D and G represent new intermediate 

pixel values as illustrated in FIG. 6, and E, F, H and I represent the 
pixel values obtained from the 1/4 resolution image. The additional 
values of "1" and "2" in the above equations only serve for correctly 
rounding the calculation results. 

20 However, the above-described conventional motion 

compensation method requires to record a long operation value of 
25 bits in the filtering process for 1/4 sub-pixel interpolation. This 
causes a particular disadvantage of such an interpolation approach 
that long registers are needed resulting in high hardware complexity 

25 and computational effort. 

The present invention is conceived in view of this drawback. 
An object of the present invention is to provide a motion 
compensation method for reducing operational workload and 
simplifying a hardware configuration. 

30 

Disclosure of Invention 
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In order to achieve the above-described object, the motion 
compensation method of the present invention includes: 
interpolating sub-pixels in a reference picture; and performing 
motion compensation based on the interpolated reference picture, in 
5 the method, the interpolating including: a first calculation step of 
calculating base values which are bases of sub-pixel values of the 
sub-pixels by multiplying coefficients with pixel values of pixels 
included in the reference picture; and a first rounding step of 
deriving the sub-pixel values of the sub-pixels by rounding the base 

10 values calculated in the first calculation step instead of directly 
using the base values in calculating sub-pixel values of other 
sub-pixels; and the performing of motion compensation includes 
performing motion compensation based on the reference picture 
having the interpolated sub-pixels with the correspondingly derived 

15 sub-pixel values. 

For example, in the conventional method, base values of 
sub-pixels that have been calculated are directly used in calculating 
sub-pixel values of other sub-pixels. However, in the present 
invention, the base values of sub-pixels that have been calculated in 

20 the first calculation step are rounded in stead of being directly used 
in calculating the sub-pixel values of other sub-pixel values. 
Therefore, even in the case where the sub-pixel values of the other 
sub-pixels are calculated using the base values rounded, the 
number of bits to be used in the calculation can be more reduced 

25 than in the conventional way. As a result, it becomes possible to 
reduce the operational workload and to simplify the hardware 
configuration. 

Also, in a first aspect of the present invention, in the motion 
compensation method, the first calculation step may include 
30 calculating base values of sub-pixels to be interpolated in a first 
direction, and the first rounding step may include deriving sub-pixel 
values of the sub-pixels to be interpolated in the first direction by 
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rounding the base values calculated in the first calculation step. At 
this time, in a second aspect of the present invention, in the motion 
compensation method, the interpolation may further include: a 
second calculation step of calculating, using the sub-pixel values of 
5 the sub-pixels derived in the first rounding step, base values of 
sub-pixels to be interpolated in a second direction that is different 
from the first direction; and a second rounding step of deriving the 
sub-pixel values of the sub-pixels to be interpolated in the second 
direction by rounding the base values calculated in the second 

10 calculation step. 

In this way, in the process of calculating sub-pixel values of 
sub-pixels to be interpolated in the first direction and in the second 
direction, the number of bits to be used in the calculation can be 
reduced down to 16 bits from, for example, 25 bits needed in a 

15 conventional way. 

Also, a fourth aspect of the present invention, in the motion 
compensation method, the first calculation step may include 
calculating the base values of three a-fourths sub-pixels using the 
following equations when eight pixel values of pixels arrayed in the 

20 first direction are represented as A, B, C, D, E, F, G and H 
respectively and the three a-fourths sub-pixel values are 
represented as hi, h 2 and h 3 respectively: 

h, =-l^ + 3.5-10.C + 59-D + 18-£-6-F + l-G-0-//; 
/2 2 =-l^ + 4^-10.C + 39-D + 39.£-10^ + 4-G-l-//;^ 
h 3 =-0^ + l-5-6-C + 18-Z) + 59-£-10-f + 3-G-l'//. 

Here, in a fifth aspect of the present invention, in the motion 
25 compensation method, the second calculation step may include 
calculating the base values of three a-fourths sub-pixels using the 
following equations when eight pixel values of pixels arrayed in the 
second direction are represented as Di, D 2 , D 3 , D 4 , D 5/ D 6/ D 7 and D 8 
respectively and the three a-fourths sub-pixel values are 
30 represented as Vi, v 2 and v 3 respectively: 
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v,=-3-/) I +12-i) 2 -37-Z)3 +229-D 4 +71.Z) 5 -21- J D 6 +6-i) 7 -l. J D 8 ; 

v 2 =-3-D ) +12'£) 2 -39-Z)3 + 158£> 4 + 158£> 5 - 39 • £> 6 + 12 • £> 7 -3D 8 ;and 

v 3 = — 1 + 6D 2 -21-£> 3 +71-£> 4 +229-£> 5 - 37 - Z) 6 + 12 • D 7 -3 • D B . 

In this way, the coefficients used in calculating sub-pixel 
values of sub-pixels are smaller than the conventional coefficients. 
This makes it possible to further reduce the number of bits to be 
5 used in calculating the sub-pixel values. 

Also, in the fourth aspect of the present invention, the motion 
compensation method may further include a bilinear filtering of 
raising a sub-pixel accuracy by applying bilinear filtering to the 
reference picture having the interpolated sub-pixels with the 
10 correspondingly derived sub-pixel values. 

In this way, the increase in sub-pixel accuracy makes it 
possible to prevent picture quality from deteriorating during the 
picture coding processing and the picture decoding processing. 

Note that the present invention can be realized as a motion 
15 compensation method, a motion estimation method, a moving 
picture coding method and a moving picture decoding method using 
the motion compensation method, a program causing a computer to 
execute these steps of the respective methods, a recording medium 
for storing the program, and an apparatus for performing operations 
20 according to these methods. 

Further Information about Technical Background to this 
Application 

The disclosure of EP Application No. 04016437.8 filed on July 
25 13th, 2004 including specification, drawings and claims is 
incorporated herein by reference in its entirety. 

Brief Description of Drawings 

These and other objects, advantages and features of the 
30 invention will become apparent from the following description 
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thereof taken in conjunction with the accompanying drawings that 
illustrate a specific embodiment of the invention. In the Drawings: 
FIG. 1 is a block diagram showing the structure of a moving 
picture coding apparatus; 
5 FIG. 2 is an illustration of how a video image is divided into 

blocks; 

FIG. 3 is an illustration of horizontal filtering for calculating a 
1/4 sub-pixel accuracy in the horizontal direction; 

FIG. 4 is an illustration of vertical filtering for calculating a 
10 1/4 sub-pixel accuracy in the vertical direction; 

FIG. 5 is an illustration of horizontal and vertical filtering for 
calculating a 1/8 sub-pixel accuracy; 

FIG. 6 is an illustration of bilinear filtering in the diagonal 
direction for calculating a 1/8 sub-pixel accuracy; 
15 FIG. 7 is a block diagram showing the configuration of a 

moving picture coding apparatus in the embodiment of the present 
invention; 

FIG. 8 is a flow chart showing the motion compensation 
operation performed by the moving picture coding apparatus in the 
20 embodiment; 

FIG. 9 is a comparison graph illustrating the difference 
between a coding result of a first image in the present invention and 
a coding result of another image obtained using a conventional 
method; 

25 FIG. 10 is a comparison graph illustrating the difference 

between a coding result of a second image in the present invention 
and a coding result of another image obtained using a conventional 
method ; 

FIG. 11 is a block diagram showing the structure of a moving 
30 picture decoding apparatus in the embodiment of the present 
invention; and 

FIG. 12 is an illustration of an interpolation method 



- 12- 



concerning the variation of the embodiment. 

Best Mode for Carrying Out the Invention 

A moving picture coding apparatus and a moving picture 
5 decoding apparatus in the embodiment of the present invention will 
be described below with reference to figures. 

In video coding, the coding efficiency is increased by applying 
motion estimation and motion compensation in predictive coding. 
The motion estimation and compensation can be improved by 
10 reducing the differential remaining between the image data to be 
coded and the predictive image data. In particular, a 1/8 sub-pixel 
motion vector accuracy can further improve the coding efficiency. 

The present invention achieves an improved motion 
estimation and compensation without increasing the hardware 
15 complexity and the computational effort accordingly. This is 
because the present invention enables to only employ a 16-bit 
accuracy of intermediate calculation results for this purpose. 

FIG. 7 is a block diagram showing the configuration of the 
moving picture coding apparatus in this embodiment. 
20 This moving picture coding apparatus 100 includes: a 

substractor 110; a transform/quantization unit 120; an inverse 
quantization/inverse transform unit 130; an adder 135; a 
deblocking filter 137; a memory 140; a 16-bit operation 
interpolation filter 150; a motion compensation/prediction unit 160; 
25 a motion estimation unit 170; and an entropy coding unit 190. 

The subtractor 110 subtracts a prediction signal PS from an 
input signal IS indicating a moving picture and outputs the 
differential to the transform/quantization unit 120. 

The transform/quantization unit 120 obtains the differential 
30 from the subtractor 110 and performs coding processing of 
frequency transform (such as DCT transform) and quantization 
using the differential. After that, the transform/quantization unit 
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120 outputs the quantized transform coefficient QC that is the 
processing result to the entropy coding unit 190 and the inverse 
quantization/inverse transform unit 130. 

The inverse quantization/inverse transform unit 130 performs 
5 decoding processing of inverse quantization and inverse DCT 
transform using the quantized transform coefficient QC outputted 
from the transform/quantization unit 120. After that the inverse 
quantization/inverse transform unit 130 outputs the differential 
decoding signal DDS that is the processing result to the adder 135. 

10 The adder 135 adds the differential decoding signal DDS to 

the prediction signal PS obtained from the motion compensation 
prediction unit 160, and outputs the picture obtained as the result to 
the deblocking filter 137. 

The deblocking filter 137 removes the block distortion of the 

15 picture outputted from the adder 135, and stores the picture with no 
block distortion in the memory 140 as a reference picture. 

The 16-bit operation interpolation filter 150 extracts a 
reference picture from the memory 140 and performs 1/8 sub-pixel 
interpolation of the reference picture. 

20 The motion estimation unit 170 estimates a motion vector 

based on the picture indicated by the input signal IS and the 
reference picture on which 1/8 sub-pixel interpolation has been 
performed using the 16-bit operation interpolation filter 150. After 
that, the motion estimation unit 170 outputs the motion data MD 

25 indicating the detected motion vector to the motion 
compensation/prediction unit 160 and the entropy coding unit 190. 

The motion compensation/prediction unit 160 performs 
motion compensation based on the motion vector indicated by the 
motion data MD and the reference picture on which 1/8 sub-pixel 

30 interpolation has been performed using the 16-bit operation 
interpolation filter 150. In this way, the motion 

compensation/prediction unit 160 predicts the current picture 
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indicated by the input signal IS and outputs the prediction signal PS 
indicating the prediction picture to the subtractor 110. 

The entropy coding unit 190 performs entropy coding of the 
quantized transform coefficients QC outputted by the 
5 transform/quantization unit 120 and the motion data MD outputted 
by the motion estimation unit 170, and outputs the result as a bit 
stream BS. 

The moving picture coding apparatus 100 in the embodiment 
like this has a feature of including a 16-bit operation interpolation 

10 filter 150. In other words, the motion compensation method in this 
embodiment has a feature that motion compensation is performed 
using the 1/8 sub-pixel interpolation by this 16-bit operation 
interpolation filter 150. 

Note that, in the moving picture coding apparatus 100 in this 

15 embodiment, the respective functional units other than the 16-bit 
operation interpolation filter 150 have the same functions as the 
respective functional units included in the above-described 
conventional moving picture coding apparatus. 

The 16-bit operation interpolation filter 150 calculates a 1/4 

20 sub-pixel value using a method other than a conventional method, 
and then calculates 1/8 sub-pixel value using the 1/4 sub-pixel 
value like in the case of the conventional method. The method how 
this 16-bit operation interpolation filter 150 calculates 1/4 sub-pixel 
value will be described in detail. 

25 A two-step procedure is employed for obtaining the 1/8 pixel 

accuracy. In a first stage including two interpolation steps, a 
horizontal and a vertical filtering is subsequently employed. For 
interpolating 1/4 sub-pixel values in the horizontal direction, the 
following Equations (11) to (13) are applied: 

30 
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h x =-\-A h + 3B h -\0C h + 59 D h +lS •£,-6-F A +l'G A -0.//,...(ll) 
h 2 = -lA h +4B h -\0C h +39-D h +39E h -10F h +4G h -\H h ..X\2) 
h, =-0-^+1.5, -6 Q +18- Z\ 4-59 ^ -10- F„ +3-G r l- //„... (13) 

In the above equations, hi to h3 represent the 1/4 sub-pixel 
values to be interpolated, and A x -H x represent the original full-pel 
5 pixel values. 

Here, the respective coefficients of A x -H x in this embodiment 
are set so that unnecessary high frequency components generated 
through interpolation are eliminated like in the conventional method. 
More specifically, the coefficients are set smaller than the 

10 conventional coefficients under the condition that picture quality 
does not deteriorate in the coding and decoding processing. In 
other words, the respective coefficients in this embodiment are set 
smaller in proportion to the respective coefficients of the 
conventional Equations (1) to (3). 

is After completing the horizontal filtering, the calculated values 

are rounded by being downshifted. For example, the intermediate 
value of hi is rounded using the following Equation (14). 



20 



( 64 ^ 

V ^ J 



»6...(14) 



Here, hi represents the interpolated pixel value resulting 
from horizontal filtering, and hi' represents the respectively 
downshifted pixel value. A corresponding processing is applied to 
all of the interpolated pixel values resulting from horizontal filtering. 
25 Note that w »6" in the Equation (14) represents 6-bit downshifting. 

In the second step of the first stage, the horizontally 
increased sub-pixel accuracy is also obtained in the vertical 
direction. For this purpose, a vertical filtering is applied. The 
previously performed downshift operation provides that none of the 
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intermediate calculations exceeds a 16-bit accuracy in the vertical 
filtering step. The vertical filtering is performed by employing the 
filter coefficients as shown in the following Equations (15) to (17) 
which correspond to Equations (11) to (13) in the case of the 
5 horizontal filtering: 



v, = -1.^3 +3-^-10-^+59.^+18-^ -6 D v+2 -hl A + 3-0 Z) v+4 ...(l5) 
v 2 =M-i>^+4.Z>^ 2 -10-^ 

v 3 =-0- D v _ 3 +1- D v _ 2 -6- D v _ } +18- D v + 59- D v+l -\0 D v+2 +3- D v+3 -1 -Z) v+4 ...(17) 



Here, Vi to v 3 refer to the 1/4 sub-pixel values in the vertical 
10 direction and D v -3, D v -2, D v -i, D V/ D v +i, D v +2, D v +3 and D v +4, represent 

the full-pel pixels in the vertical direction. In other words, the 

full-pel pixels are pixels 210 and 220 from FIG. 3. 

Here, the respective coefficients of D x (D v _3 to D v+4 ) in this 

embodiment are set smaller in proportion to the respective 
15 coefficients of the conventional Equations (4) to (6) like in the case 

of the respective coefficients of the above A x — H x . 

The calculation results from the vertical filtering, namely, 

pixel values 230, are subjected to downshifting by applying the 

following Equation (18) which is illustrated as an example case of Vi 
20 only: 



64 

v/=[ v, + — 
2 j 



»6...(18) 



A rounding during the downshift operation is achieved by 
adding the value 2 6 /2 = 64/2 to the interpolated pixel value. 
25 Although, the above description firstly applies a horizontal 

filtering and secondly a vertical filtering together with respective 
downshift operations, a skilled person in the art is aware that the 
horizontal and vertical operations may be exchanged to achieve the 
same result. Thus the vertical filtering may be performed before a 
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horizontal filtering. 

The finally obtained sub-pixel values with a 1/4 sub-pixel 
accuracy are clipped in order to fall within a range between 0 and 
255. 

5 The obtained 1/4 sub-pixel values are subjected to a bilinear 

filtering as it has been described above in connection with FIG. 5 and 
FIG. 6 in order to obtain a 1/8 sub-pixel resolution. 

The following example demonstrates that the processing of 
the present invention does not require any registers for 
10 intermediate pixel values exceeding a 16-bit accuracy. 

Assuming that a pixel value falls in the range between 0 and 
255, the largest possible values during a horizontal 8-tap filtering 
may occur when employing the following Equation (19) for 
calculating intermediate pixel value h 2 : 

15 

h 2 =-1-0 + 4- 255 + (-10) -0 + 39-255 + 39- 255 + (-10)-0 + 4 -255 -1-0.. .(19) 
h 2 = 21930 < 32768 = 2 15 => 156/7 + lbit(sign)... (20) 



In this way, this embodiment can eliminate the necessity of 
performing the calculation over 16 bits in the calculation processing 
20 of 1/4 sub-pixel values. 

The resulting pixel value is downshifted as indicated by the 
following Equation (21): 



25 



64 

21930 + — 
2 



»6 = 343...(21) 

J 



The result of the downshift operation is clipped to the range of 
0 to 255. 

As demonstrated above, the required pixel accuracy for the 
largest possible values during the filtering operation does not 
30 exceed 16-bits. Although the above operation example has only 
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been calculated for the horizontal direction, corresponding 
coefficients are used for the vertical filtering and, thus, identical 
advantages are applied to the vertical filtering. 

The above example only relates to the 1/4 sub-pixel 
5 resolution calculation. The bilinear filtering for generating a 1/8 
sub-pixel resolution only requires a maximum accuracy of 10-bits. 
Thus, a maximum accuracy of 16-bits is sufficient for performing all 
calculations of the present invention. Accordingly, the motion 
estimation, motion compensation and the coding and decoding of 
10 data moving picture can be improved in a simple manner. 

FIG. 8 is a flow chart showing" the motion compensation 
operation performed by the moving picture coding apparatus 100 in 
the embodiment. 

First, the 16-bit operation interpolation filter 150 of the 
15 moving picture coding apparatus 100 calculates 1/4 sub-pixel values 
(base values which are bases of sub-pixel values) of the reference 
picture extracted from the memory 140 in the horizontal direction 
(S100). After that, the 16-bit operation interpolation filter 150 
performs downshifting of the pixel values obtained in Step 100, and 
20 rounds the pixel values (Step 102). 

Next, the 16-bit operation interpolation filter 150 calculates 
1/4 sub-pixel values in the vertical direction using the pixel values 
rounded in Step 102 (Step 104). After that, the 16-bit operation 
interpolation filter 150 performs downshifting of the pixel values 
25 obtained in Step 104 and rounds the pixel values (Step 106). 

Through the operation of Step 100 to Step 106 like this, 1/4 
sub-pixels of the reference picture are interpolated in the horizontal 
direction and the vertical direction. 

When 1/4 sub-pixels are interpolated, the 16-bit operation 
30 interpolation filter 150 calculates 1/8 sub-pixels by performing 
bilinear filtering using the interpolated 1/4 sub-pixels like in the 
conventional case, in other words, the 16-bit operation interpolation 
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filter 150 raises the pixel accuracy of the reference picture from 1/4 
sub-pixel accuracy to 1/8 sub-pixel accuracy (Step 108). 

Through Step 100 to Step 108 performed by the 16-bit 
operation interpolation filter 150 like this, a reference picture with 
5 interpolated 1/8 sub-pixel values is generated. 

After that, the motion compensation/prediction unit 160 
performs motion compensation using the reference picture with 
interpolated 1/8 sub-pixels and outputs the prediction signal PS 
indicating the result (Step 110). 

io For demonstrating that similar results compared to 

conventional interpolation implementations can be achieved when 
applying the present invention, the algorithm of the present 
invention has been implemented into the H. 264/MPEG encoder 
processing software (JM4.2). The calculation results are illustrated 

15 in FIG. 9 and FIG. 10 by rate distortion curves indicating the impact 
on the perceived picture quality. Both figures differ only because 
different image sequences are employed as examples. 

The rate distortion curves of FIG. 9 and FIG. 10 are shown 
over the bit rate on the X-axis and the peak signal to noise ratio 

20 (PSNR) on the Y-axis representing a measure for the introduced 
distortions. 

FIG. 9 and FIG. 10 demonstrate that the 16-bit 
implementation of a 1/8 sub-pixel filter (1/8-pel 16 bit) does not 
result in an image quality degradation compared to the conventional 
25 JM4.2 algorithm (1/8-pel 25-bit) although the JM4.2 algorithm 
requires longer registers. In addition, the approach of the present 
invention actually performs better than 1/4 sub-pixel 20-bit coding 
(1/4-pel 20 bit). 

FIG. 11 is a block diagram showing the configuration of a 
30 moving picture decoding apparatus in the embodiment of the 
present invention. 

This moving picture decoding apparatus 300 includes: an 
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entropy decoding unit 310; an inverse quantization/inverse 
transform unit 320; an adder 330; a deblocking filter 340; a memory 
350 and a motion compensation unit 360. 

The entropy decoding unit 310 obtains a bit stream BS 
5 outputted by the moving picture coding apparatus 100 and performs 
entropy decoding processing of the bit stream. As the result, the 
entropy decoding unit 310 outputs the quantized transform 
coefficients QC to the inverse quantization/inverse transform unit 
320 and outputs the motion data MD indicating the motion vector to 

io the motion compensation unit 360. 

The inverse quantization/inverse transform unit 320 performs 
decoding processing of inverse quantization and inverse DCT 
transform using the quantized transform coefficients QC. After that, 
the inverse quantization/inverse transform unit 320 outputs the 

15 differential decoding signal DDS that is the result of the processing 
to the adder 330. 

The adder 330 adds the differential decoding signal DDS to 
the prediction signal PS obtained from the motion compensation unit 
360, and outputs the resulting picture to the deblocking filter 340. 

20 The deblocking filter 340 eliminates the block distortion of the 

picture outputted from the adder 330, and stores the picture with no 
block distortion to the memory 350. The decoded picture is 
extracted from the memory 350 as the output signal OS. 

The motion compensation unit 360 includes: a 16-bit 

25 operation interpolation filter 361 for extracting the picture stored in 
the memory 350 as a reference picture and performing 1/8 sub-pixel 
interpolation of the reference picture; and a motion compensation 
prediction unit 361 for predicting the current picture. This motion 
compensation prediction unit 361 performs motion compensation 

30 based on the motion vector indicated by the motion data MD and the 
reference picture on which 1/8 sub-pixel interpolation is performed 
using a 16-bit operation interpolation filter 361. In this way, the 
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motion compensation/prediction unit 361 predicts the current 
picture and outputs the prediction signal PS indicating the prediction 
picture to the adder 330. 

The moving picture decoding apparatus 300 like this also has 
5 a feature of including a 16-bit operation interpolation filter 361 like 
in the case of the moving picture coding apparatus 100. This 16-bit 
operation interpolation filter 361 has the same function as the 16-bit 
operation interpolation filter 150 of the moving picture coding 
apparatus 100. Therefore, with this moving picture decoding 

io apparatus 300, it is possible to reduce operation workload and 
simplify a hardware configuration without using pixel values 
exceeding 16 bits in the process of calculating the pixel values. 

Summarizing, the present invention provides an improved 
motion estimation and compensation by only employing a simplified 

is hardware configuration and less computational effort. This is 
achieved by employing particular filter coefficients and additional 
downshift operations when obtaining a 1/4 sub-pixel resolution 
image. Accordingly, a more efficient coding and decoding with a 
simpler hardware configuration can be achieved. 

20 

(Variation) 

Here, an variation of the method for interpolating 1/4 
sub-pixel values in the embodiment will be described below. 

In the above-described embodiment, a two-step interpolation 

25 is performed in the following way: 1/4 sub-pixel values are 
interpolated in the horizontal direction; and then other 1/4 sub-pixel 
values are interpolated in the vertical direction. However, a 
single-step interpolation is performed instead of the two-step 
interpolation in this variation, the single-step interpolation being 

30 able to achieve the same effect as the effect obtained through both 
the interpolation in the horizontal direction and the vertical direction. 
In other words, the 16-bit operation interpolation filter 150 of this 
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variation has a function as a two-dimensional filter. 

FIG. 12 is an illustration of an interpolation method 
concerning the variation of the embodiment. 

In this FIG. 12, white circles show pixels of full pixel unit that 
are present in a reference picture, and the pixel values of the pixels 
that are present in the horizontal position h and the vertical position 
v are represented as P h , v - Also, the number of taps of the 
two-dimensional filter is 36 (6 taps in both the horizontal direction 
and the vertical direction). 

In this case, the 16-bit operation interpolation filter 150 
calculates pixel values P hV/ u 0, j = 0 to 3, excluding "i = 0 and j = 0") 
of sub-pixels to be interpolated using the following Equation (22). 
Here, dj (m, n) is a filter coefficients (m, n = —2 to 3) and generally 
vary depending on the position (i, j) of the pixel to be interpolated. 
After that, the sub-pixel values calculated in this way are 
downshifted. 

3 3 

Phvjj = SX c >/(' w ' w ) P ^v-„---(22) 

m=-2n=-2 

20 In this variation like the case of the conventional example, 

those calculated sub-pixel values are always rounded and not used 
for calculating the pixel values of other sub-pixels. Thus, it is 
possible to reduce the number of bits necessary for the calculation 
process of sub-pixels. 

25 Although only an exemplary embodiment of this invention has 

been described in detail above, those skilled in the art will readily 
appreciate that many modifications are possible in the exemplary 
embodiment without materially departing from the novel teachings 
and advantages of this invention. Accordingly, all such 

30 modifications are intended to be included within the scope of this 
invention. 
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Industrial Applicability 

The motion compensation method concerning the present 
invention provides the following two effects that: operation 
5 workload can be reduced; and a hardware configuration can be 
simplified. For example, the motion compensation method can be 
applied for a moving picture coding apparatus for coding a moving 
picture, a moving picture decoding apparatus for decoding the coded 
moving picture, and the like. 
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